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ABSTRACT 

We present an accurate and fast framework for generating mock catalogues including 
low-mass halos, based on an implementation of the COmoving Lagrangian Accel¬ 
eration (COLA) technique. Multiple realisations of mock catalogues are crucial for 
analyses of large-scale structure, but conventional 7V-body simulations are too com¬ 
putationally expensive for the production of thousands of realisations. We show that 
COLA simulations can produce accurate mock catalogues with a moderate compu¬ 
tation resource for low- to intermediate- mass galaxies in haloes, both in 

real and redshift space. COLA simulations have accurate peculiar velocities, with¬ 
out systematic errors in the velocity power spectra for k ^ 0.15hMpc“^, and with 
only 3-per-cent error for k ^ 0.2/iMpc“^. We use COLA with 10 time steps and a 
Halo Occupation Distribution to produce 600 mock galaxy catalogues of the WiggleZ 
Dark Energy Survey. Our parallelized code for efficient generation of accurate halo 
catalogues is publicly available at github.com/junkoda/cola_halo. 

Key words: cosmology: theory - large-scale structure of Universe - methods: nu¬ 
merical. 


1 INTRODUCTION 


Generating multiple realisations of mock galaxy catalogues 
is essential for analysing large-scale structure in the Uni¬ 
verse. It is a necessary tool for evaluating the statistical un¬ 
certainties in the clustering measurements, and systematic 
errors in theoretical modelling and data analysis. The im¬ 
portance of accurate mock catalogues is increasing as data 
analyses become more complicated and sophisticated, and 
the large-scale-structure measurements become more pre¬ 
cise. 


One of the targets of cosmological surveys is the Baryon 
Acoustic Oscillation (BAO) feature imprinted in the galaxy 
clustering (Cole et al. 2005| 


Eisenstein et al. 2005 


Blake 


et al.|[^ll| Beutler et al.||20T2 Anderson et ai.||2014 K It a 

‘standard ruler’ that provides robust measurements of the 
expansion history of the Universe through the cosmological 
distances as a function of redshift. The data analysis proce¬ 
dure was recently refined by the ‘reconstruction’ technique 
(Eisenstein et al. 20071, which improves the precision by 


sharpening the BAO peak by rewinding the large-scale dis¬ 
placements in part. This technique was first applied to the 


* E-mail: jun.koda@brera.inaf.it 


Sloan Digital Sky Survey Data Release 7 (Padmanabhan 
|et al.|[^12[ [Mehta et al.||2012[), and has become a standard 


procedure ( Anderson et al.||2012 2014| Kazin et al.|2014|. 


Covariance matrices, e.g, Cij = (^(ri)^(rj)) — (^(ri)) 
{^{rj)), for two-point correlation function ^(r), need to be 
calculated for any analyses of large-scale structure to evalu¬ 
ate the best-fitting cosmological parameters and their con¬ 
fidence regions. The ensemble averages for the covariance 
matrix can be computed directly from many realisations of 
mock galaxies. The benefit of multiple realisations of mock 
galaxy catalogues to build the covariance matrix is not lim¬ 
ited to BAO, but the preference of using mocks over other 
methods is clear for BAO due to its large length scale of 150 
Mpc and non-trivial numerical process in the reconstruction. 
Mock galaxy catalogues based on simulations can properly 
evaluate the error caused by imperfect reconstruction due 
to non-linear motions and realistic selection function. Alter¬ 
native methods like jack-knife sampling work for measure¬ 
ments on small scales, but we often do not have enough 
quasi-independent subvolumes assumed for jack-knife sam¬ 
pling on BAO scales. Log-Normal realisations of the galaxy 


density field (Coles & Jones 19911 can provide many samples 


of large-scale fields, but non-linear dynamics is not accurate; 
one of the sources of uncertainties we would like to evalu- 
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ate for the BAO measurement is the amount of non-linear 
motion that is not completely captured by the rewinding in 
the reconstruction algorithm. 

Running many A^-body simulations for multiple realisa¬ 
tions is ideal, but it requires a large amount of computation 
time on a massively-parallel supercomputer. An insufficient 
number of mock catalogues would give biased error evalua¬ 
tion; even with 600 mocks, careful treatment is necessary to 
evaluate the inverse covariance matrix ( [Hartlap et al.|2007[ 
Percival et al.|[^014 K Simulations will be harder as the sur¬ 
vey volume becomes larger (BigBOSS Schlegel et al.|2009 |, 
and the resolution required to resolve galaxy-hosting haloes 
become higher, e.g., for emission-line galaxies in HETDEX 


(Hill et al. 20041, Euclid (Amendola et al. 20131, or Fast- 
Sound ([Tonegawa et al.||2015[) surveys, or for less-luminous 


galaxies in deeper surveys, GAMA (Driver et al. 20111 or 
VIPERS ( Garilli et al.|[20l4 l]. Manera et al. (20131 gener¬ 


ated 600 mock galaxy catalogues, ‘PThalos’, for the Baryon 
Oscillation Spectroscopic Survey (BOSS), using the 2nd- 
order perturbation theory and Friends-of-Friends halo finder 
| | Davis et al. ]TM5| ). Theoretical ideas of fast simulations us¬ 


ing analytical theories exist a decade ago ( Scoccimarro fc 
Sheth|[2002 Monaco et al.||2002 ), or even earlier (adhesion 


approximation, Gurbatov et al.|1989l, but such research at¬ 


tracted attention after the practical application to BOSS 


(Monaco et al. [2013 |de la Torre & Peacock 2013| White 

et al. 

2014 

Kitaura et al.|2014||Angulo et al.|2014 [Chuang 

et al. 

2015a 

Avila et al.|2015 I. See Chuang et al. 

(2015b 

1 for 


a comparison of these methods. Some of these recent meth¬ 
ods randomly generate haloes, instead of resolving haloes, 
using a probability that depends on the local dark matter 
density. 

We generate 600 mock galaxy catalogues for the Wig- 


gleZ Dark Energy Survey (Drinkwater et al. 20101 for the 


improved BAO measurement using the reconstruction tech¬ 


nique (Kazin et al. 20141 and for other analyses (Burrage 
et al.|20l5 Beutler et al.|20l5 Marin et al.|2015 1. The Wig- 
gleZ samples are emission-line galaxies in dark-matter haloes 
of masses approximately IO^^Mq, which is about an order 
of magnitude smaller than the haloes hosting the BOSS 
CMASS galaxies. We use the COmoving Lagrangian Accel¬ 
eration (COLA) ( [Tassev et al. 20131 method to run many 
realisations of simulation, after hnding that the PTHalo 


method by Manera et al. ( 2013 1 was not able to resolve 
IO^^Mq haloes (see Section 3.11. In this paper, we present 


the accuracy of COLA mocks on large scales relevant to cos¬ 
mological analyses, which was not tested with the small sim¬ 


ulation box by Tassev et al. (2013), and show that COLA is 


accurate not only for massive CMASS-like galaxies (Chuang 
et al.|[^15b l but for lower-mass galaxies. COLA is becom¬ 
ing a common tool when a large number of simulations is 
required ( Howlett et al.|2015a|b Leclercq et al.|[^015 ). 

This paper is organised as follows. We first review the 
COLA algorithm, and describe our COLA simulations for 
the WiggleZ survey in Section and compare our simu¬ 
lations with conventional A-body simulations in Section]^ 
We describe our mock galaxy catalogue based on COLA in 
Section and compare the mock galaxies with those based 
on conventional simulations in Section Throughout the 
paper, we use a flat ACDM cosmology with Qm = 0.273, 
Qa = 0.727, Qb = 0.0456, h = 0.705, as = 0.812, and 


ris = 0.961, which is the WMAP5 cosmology (Komatsu 


et al. 2009 

1 used for the Gigaparsec WiggleZ simulation 

Poole et al. 

(2015). 


2 COLA SIMULATION 

We use the COmoving Lagrangian Acceleration (COLA) 
method invented by [Tassev et al.| ( |2013| TZE hereafter) 
to run many realisations of cosmological simulations with 
a reasonable amount of computation time. COLA enables 
a reduction in the number of time steps by combining 2nd- 
order Lagrangian Perturbation Theory (2LPT) and A-body 
simulation. 


2.1 Introduction to the COLA algorithm 

A typical time-evolution method for A-body simulation is 
the leapfrog integration: 


Xi+i =Xi + Ui+i/aAt 

lIi+l/2 = Uj-l/2 + F{Xi)At 


( 1 ) 

( 2 ) 


where Xi (i = 0,1, 2,...) is the position of a particle at time 
ti = iAt, Vi+i /2 is the velocity at ti+i /2 = (i-|-l/2) At, and 
F{x) is the acceleration at x, for some time step At. [The 
equations are solely for illustrating the difference between 
the conventional leapfrog integration and COLA; terms for 
the expanding Universe are dropped. See, e.g., [Quinn et alT] 
(19971 for the leapfrog time stepping for cosmological sim¬ 


ulations.] The leapfrog integration is accurate up to second 
order in At, but the truncation error from higher orders in 
At makes the time evolution inaccurate for large At. In ad¬ 
dition, the time step is usually proportional to the Hubble 
time H~^{t) to integrate accurately in cosmological simu¬ 
lations, which is smaller at higher redshifts. Since we can 
approximate the motion well by 2LPT at high redshifts, we 
can use larger time steps at high redshifts with COLA than 
the conventional leapfrog integration. 

COLA has two techniques that improve the accuracy 
of time integration for large time steps. First, it uses the 
discrete time evolution only for the non-linear terms beyond 
2LPT, i.e., the residual particle position, velocity, and ac¬ 
celeration from their 2LPT contributions: 


aji-es = a: - a; 2 LPT(t), (3) 

iTi-es = I’- a:2LPT(t), (4) 

Fres = F{x) — i 2 LPT(t), (5) 

where the dots are time derivatives, and, 

a: 2 LPT(t) = q + Di(f)^^^'(q) -|- D 2 {t)^^^\q), (6) 

is the growing-mode solution of 2LPT, mapping the ini¬ 
tial comoving position q to a later position at time t. The 
time evolution is given by the linear growth factor D\(t), 
and the second-order growth factor D 2 {t), which is approx¬ 
imately D 2 (t) = —|Di(f)^H(a(f))“^^^"^®, where f2(o) = 
Hm(Hm + is the H matter at scale factor a (see. 


^ The public 2LPTlC code (footnote 3), originally designed to 
generate initial conditions at high redshifts, does not contain the 
factor which is negligible at high redshift. We correctly 

include this factor. The effect, however, is negligible, only a sub- 
per cent contribution to the second order at all redshifts. 
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[Bouchet et al.|[i995| [Bernardeau et aTT]|2002[ and references 
therein for 2LPT). The hrst- and second-order motions are 
integrated analytically with 2LPT, which does not have the 
truncation error for discrete At. 

The second component is an ansatz that the residual 
velocity decays as, 


V (t) 


- V 


res 

i + 1/2 


aid) 

‘(fi+l/2). 


, '^LPT 


for ti ^ t ^ ti+i during a drift step Xi i— Xi+i , and 


^ Ai + Bia{ty 


( 7 ) 


( 8 ) 


for ti-i /2 ^ t SS ti+i /2 during a kick step, <-1/2 ^ <+1/2, 
where Ai and Bi are constants, a is the scale factor, and 
riLPT = —2.5 is a free parameter. These functions replace 
the linear functions of At in equations ([l|^, and suppress 
the higher-order terms. (Note that the growing mode is cap¬ 
tured by 2LPT, and the residual term is a decaying mode — 
at least in the linear perturbation theory.) The two ansatz 
are empirical, and not exactly consistent with each other; 
equation Q is assuming that Ai is negligible compared 


to the second term with Bi in equation tek. In fact, TZE 


suggest another ansatz, 8-s a replacement 

for equation 0 for simulations starting at high redshift 
2 ~ 49 with low mass resolution, which is the other limit 
that Bia{t)^^^^ is negligible compared to Ai. The opti¬ 
mum ansatz, including the value of ulpt, depends on the 
redshift and resolution. ‘Experimentation is always advised 


with COLA’ (TZEI. 


2.2 Basic equations 

We briefly review the equations of motion of dark matter 
particles in the expanding Universe, and then present the 
COLA time evolution equations (see also the original de¬ 
scription by TZE I. Let x be the comoving coordinate of 


an A-body particle, and v = be its canonical velocity. 
The canonical velocity, v = m~^dL/dx, follows from the 
Lagrangian, 


1 2 

L =-m{ax) —m4>{x,t), 


( 9 ) 


where m is the particle mass, ax is the physical peculiar 
velocity, and (j) is the peculiar gravitational potential that 
satisfies the Poisson equation in the physical coordinate 
Vphys = V/a: 


) (l>{x,t) = ATiG[p{x,t) - p{t)\, (10) 


for the matter density p and mean matter density p. This 
can be written as. 


V^(j>{x,t) = ^{t)5{x,t), 


( 11 ) 


using the density contrast 5 = p/p — 1, the present critical 
density Pcrit.o = 3Hq/{StvG), Hubble constant Hq, and the 
present matter density Qm = p/pcrit,o- The Euler-Lagrange 
equation gives the equations of motion. 


X = v/a{t) 


( 12 ) 


V = m ^dL/dx —<f){x,t) = F{x,t)- (13) 


We discretize the time into Uatep = 10 steps, uniformly in a 
between scale factor 0 and 1, 


a{ti^ — — ^/Ustep, 

fl(U+l/ 2 ) = Cli+ 1/2 = (j -f l/2)/nstep 


and set the initial condition. 


x^%ti) = 0, 


^'■"“(fl/2) = 0, 


(14) 

(15) 

(16) 


which means that the position and the velocity are exactly 
equal to those of 2LPT. This is slightly different from |TZ^ 
they set the initial condition at a = 0.1 for both the position 
and the velocity, and divide the scale factor by 10 between 
0.1 and 1. (Our time stepping is ’9 steps’ in their language.) 
Even though setting initial velocity at ti /2 is natural for 
leapfrog integration, we find that this causes 2-3 per cent 
excess in matter power spectrum at fc ~ 0.2hMpc“^; the 
original |TZE| initial condition may be more accurate. We 
present the results of the original initial condition in Ap¬ 
pendix]^ 

The ansatz for the drift step (equation and one of 
the equations of motion (equation |12| give. 


res /, \ res , res 

X [t) ^Xi -f Ui+i/2 


a 


fli+l/2 


dt' 

W? 


(17) 


for ^ t ^ ti+i. We compute the integral numerically, 
which is common for all particles. The time evolution during 
the kick step (equation]^ is. 


*(l) — t’i-l/2 + ' 


lity 


i-l/2 


yxy. (18) 


riLPT a{tiYLPT-^ a{ti) 

for ti_i /2 SS t ^ ti+i/ 2 ', the constants Ai and Bi in equa¬ 
tion (|^ are set by matching the velocity at t = ti_i/ 2 , 

F^yu-1/2) = VT<,^, (19) 

and the acceleration at t = ti, 

h’''’°(ti) = BiULPT a{tiY^^'^~^ h(ti) = F'"^"‘{xi). (20) 

We use equations ( 17|18 \ to update the A-body particle po¬ 
sitions and velocities Xi i—>■ Xi+i, Vi_x /2 I’i+i/z, and also 
to interpolate the quantities between timesteps for snapshots 
at redshifts of our interest. 

2.3 The WiggfeZ COLA (WiZ-COLA) simulation 

The WiggleZ-COLA (WiZ-COLA) simulation is a set of 
COLA simulations designed for the WiggleZ Dark Energy 
Survey ( Drinkwater et aL]|2010 ( to quantify the systematic 
and statistical errors in data analyses. We run 3600 COLA 
simulations with different initial random modes to generate 
600 independent realisations of mock galaxies for six survey 
regions in the sky (we use six independent realisations for the 
six regions). The WiggleZ survey is a redshift survey which 
covers about 1000 deg^ up to redshift 1. The survey volume 
consists of six regions in the sky, and analysed in three red¬ 
shift bins A^^®” (0.2 < 2 < 0.6), Az“‘^ (0.4 < 2 < 0.8), 
and Az^" (0.6 < 2 < 1.0). We use a periodic simulation box 
of 600h“^Mpc on a side to cover any one of these redshift 
bins. The mass of dark matter haloes hosting the emission¬ 
line galaxies in the Wig gleZ sample, inferred from the galaxy 
bias ( Marin et al.|2013 l, is about Mq. We use 1296® 


particles, which gives the particle mass 7.5 x Itrh Mq, to 
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Figure 1. Simulation particles in 2LPT, COLA, and GADGET simulations, from left to right, respectively, in subvolumes of 100 X 
100 X 2 (h“^Mpc)^. The red particles are particles in massive haloes, M ^ 10^^ h~^MQ, and blue particles are in low-mass haloes, 
lO'^^ h~^MQ M < 10^^ h~^MQ. 2LPT simulation can only resolve massive haloes, while COLA can resolve both massive and low- 
mass haloes. 


have more than 100 particles for haloes we need to resolve. 
This mass resolution is e qual to that of the Gigaparsec Wig- 
gleZ simulation (GiggleZ, Poole et al. 20151, which has 2160^ 
particles in a lh“^Gpc box on a side. We use (3 x 1296)® 
meshes for Particle Mesh (PM) gravitational force solver to 
resolve haloes as |TZE| suggested. 

We parallelize the publicly available serial COLA cod^ 
by |TZE| to run simulations that satisfy the volume and mass 
resolution required for the WiggleZ survey. We combine our 
parallelized COLA code with a 2LPT code, 2LPTic, [phased 
on N-GenIC|4 and a Friends-of-Friends (FoF) halo Hnder at 
N-body sho^ for efficient on-the-fly generation of halo cat¬ 
alogues. We use a parallel Fast Fourier Transform library, 
FFTW3 ( Frigo fc Johnson|2005 1 for 2LPT and PM. We fol¬ 
low the slab decomposition of FFTW, which slices the vol¬ 
ume along one axis. We divide the simulation cube into 216 
equal-volume slices, and we move N-body particles between 
volumes after each timestep using the Message Passing In¬ 
terface (MPI). We do not write the dark matter particles to 
the hard drive, we only write the halo catalogues and the 
matter density field on a grid at redshifts 0.73, 0.6, 0.44, 
and 0. The first three redshifts are the effective redshifts of 
and Az^®”, respectively. 

We use 216 cores and 4 x 216 Gbytes of random access 
memory in the Green II supercomputer at Centre for Astro¬ 
physics and Supercomputing at Swinburne University. This 
number of cores is necessary to allocate the large mesh. In 
Table we list the composition of the computation time for 
one realisation; one realisation takes about 15 minutes, and 
the majority of them (66 per cent) are used for the FFTW 
for gravity solving. Only 2 per cent of the time is used for 
2LPT. Our COLA simulations are about a factor 50 slower 
than 2LPT, but still more than 100 times faster typical N- 
body simulations, which we describe in the following section. 


^ https://bitbucket.org/tassev/colacode/ 

^ http: //cosmo.nyu.edu/roinan/2LPT/ 

* http://www.gadgetcode.org/ 

® www-hpcc.astro.washington.edu/tools/fof.html 


Table 1. Gomputation wallclock time of each procedure in one 
GOLA simulation using 216 cores. 


Procedure 

Time 

[sec] 

Fraction 
[per cent] 

2LPT 

18 

2 

EFT in GOLA 

583 

66 

Other processes in COLA 

114 

13 

Data analysis (FOF) 

167 

19 

Total 

882 

100 


3 ACCURACY OF COLA SIMULATION 


To test the accuracy of our COLA simulations, we compare 
them with simulations performed with the same number of 
particles and the same initial random modes using the pub¬ 


licly a vailable Tree-PM iV-body code GADGET-2 (Springel 
20051. For GADGET, we use 2592® PM grids and a softening 


length equal to 5 per cent of the mean particle separation. 
We use the default values of accuracy parameters; rj — 0.025 
for the time step, and a = 0.005 for the force accuracy. We 
setup the initial condition at z = 49 using the same 2LPT 
displacement fields. We make 14 realisations, and each of the 
A-body run takes about 9000 CPU hours using 384 comput¬ 
ing cores. The computation time for one realisation is about 
160 times larger than that for our COLA simulation. 


3.1 Haloes in 2LPT, COLA and GADGET 
simnlations 

In Fig. we show slices of 2LPT, COLA, and GAD¬ 
GET simulations at redshift 0.6. The red points are sim¬ 
ulation particles in ‘massive haloes’ above lO®®/i“^M 0 , and 
blue points are particles in ‘low-mass haloes’ in the range 
10 ^®/i“®Mq < M < lO^®/i“^M 0 . We identify the haloes 
with the FoF algorithm with linking length 0.2 times the 
mean particle separation {I = 0.2) for GADGET and 
COLA, and I = 0.37 for 2LPT, following the prescription 


of PTHaloes by Manera et al. (20131. The halo masses are 
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Figure 2. Linear biases of haloes grouped by their masses; each 
bin corresponds to a number density of 2.5 X 10“^(/i“^Mpc)“®. 
COLA haloes have correct bias with about 5 per cent accuracy, 
while PThalos have reasonable bias only for the most massive bin. 

based on those of the GADGET simulation. The haloes in 
the COLA and 2LPT simulations are sorted by mass in de¬ 
scending order, and the haloes are classihed to massive or 
low-mass by the ranking. The massive PThaloes are found 
in approximately correct locations, but low-mass PThaloes 
are completely mislocated; haloes in filaments are not re¬ 
solved, and the noise around massive haloes is incorrectly 
identihed as low mass haloes. The COLA simulation, on 
the other hand, is almost indistinguishable to the GADGET 
simulation; only a small number of haloes crosses the mass 
boundary of M = due to a scatter in mass. 

We can also see the problem of the PTHaloes in the 
halo bias. In Fig. we plot the linear halo bias for haloes 
grouped by their masses. Each group has a number den¬ 
sity 2.5 X 10“"^(/i“^Mpc)“^, and the corresponding mass 
range is based on the GADGET simulation. The linear bias 
is computed by matching the amplitude of the halo power 
spectrum with the matter power spectrum of MPTbreeze 


(Crocce et al. 

2012 

1 for k ^ 0.1 h ^Mpc (see more details 

in section 3.3 

for the power spectrum computation). The 


bias of PTHaloes are always above 2, because all the haloes 
are clustered around massive haloes. The biases of COLA 
haloes have correct dependence on mass, but are 5 per cent 
smaller than those of GADGET, which is probably due to 
the scatter in the halo mass. Since there are more low-bias 
haloes than high-bias haloes, the scatter introduces a larger 
fraction of low-bias haloes into the group. 

3.2 Halo mass 

In Fig.[^ we plot the halo masses of COLA and GADGET. 
For each halo in the COLA simulation, Hcola, we find the 
GADGET halo, Hgadgbt, that contains the largest number 
of halo particles in Hcola, / : Hcola i-a- Hgadget, then 


Figure 3. The relation of halo masses in the COLA simulation 
Mcola and those in the GADGET simulation Mgadget- The 
straight red lines in both panels are the linear fit, ALcola = 
0.938 Mgadget- The solid and dashed black lines in the bot¬ 
tom panel are the mean and the standard deviation of the ratio 
-^fcOLA/AfGADGETi respectively. The ratios are almost mass in¬ 
dependent, and the scatters are 0.25 - 0.30. 

we find the same mapping in the opposite direction for each 
GADGET halo, g : Hgadget Hcola- In the figure, we 
plot the masses for a subset of halo pairs that the both map¬ 
pings exist and point to each other: {(JTcola, ^gadget) : 
/(Hgola) = Hgadget and (/(^gadget) = Hgola}- 
The linear fitting gives, 

Mcola = 0.938Mgadget. (21) 

The ratio Mcola/Mgadgbt is almost independent of mass, 
except below where the artificial increase in the 

ratio is caused by the minimum halo mass of 32 particles 
per halo. The scatters in the ratios are about 0.24 above 
Mq, and increase to about 0.3 near Mq. 


3.3 Matter power spectrum 

We compare the matter power spectra of COLA with those 
of GADGET in Fig. for the 14 realisations with same ini¬ 
tial conditions. COLA is accurate within 1.4 per cent for 
k ^ 0.1/iMpc“^ and 2.5 per cent for k ^ 0.2/iMpc“^, re¬ 
spectively. The error bars are twice the standard error in the 
mean, 

AP = 2a{P)/VK (22) 

where cr(P) = X^^i(Pi — P)'^/{Nr — 1) is the standard de¬ 
viation, P = Pi/Nr is the mean, and Nr = 14 is the 

number of the realisations. The error bars for the ratio in the 
bottom panel are too small to see; cosmic variance does not 
directly affect the ratio of two simulations using the same 
initial modes. We find an excess in the power spectrum ratio. 
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Figure 4. {Upper panel:) Matter power spectra of a COLA 
simulation (points) and a GADGET simulation (lines) at 2 : = 
0, 0.44,0.6 and 0.73, which have the same initial condition. {Lower 
panel:) Ratios of COLA power spectra to those of GADGET. 
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Figure 5. (Upper panel:) Mean matter power spectrum of 3600 
COLA simulations (WiZ-COLA, black crosses) compared to a 
non-linear analytical power spectrum by MPTbreeze (cyan line) 
at 2 = 0.6. Cyan circles are the analytical power spectrum aver¬ 
aged on discrete grids in Fourier space. (Lower panel:) The ratio 
of WiZ-COLA power spectrum to the analytical power spectrum, 
both averaged on the same discrete grid in Fourier space. COLA 
simulations give very accurate overall amplitude, in agreement 
with the analytical power spectrum within statistical fluctuation. 
(The error bars are twice the standard errors in the mean.) 


fboLA/^bADGET > 1, which was not seen in the original pa¬ 
per (jrZEl; this is caused by the slight difference in the initial 
condition (equation 16 see also Appendix]^. The amount 
of error seems to fall into two groups; a group of redshifts 0 
and 0.44, and the other group of 0.60 and 0.73. This could be 
due to our interpolation between time steps (equation |17[ ). 
Redshifts 0 and 0.44 correspond to scale factor 1 and 0.694 


which are close to the drift steps (equation 141, while the 


latter group with slightly larger errors is away from those 
scale factor by about 0.025. There is probably a room for 
a small improvement in interpolation formula by adding a 
term that uses the acceleration. 

In Fig. we plot the mean matter power spectrum 
of 3600 realisations and compare with an analytical power 
spectrum from MPTbreeze (Crocce et al. p0l2| ). The long- 
wavelength modes, fc 0.1 /iMpc“^ are accurate within the 
statistical uncertainty; the for the first 9 data points, 
k O.lhMpc”^, is 7.1. We use a publicly available cod^ 
| |Taruya et al.|201^ for th e reference ’no-wiggle’ power spec¬ 
trum (Eisenstein & Hu 1998 b The good match between 


COLA and MPTbreeze near A: = 0.1 /iMpc ^ is partially due 
to a coincidence, as we see errors larger than 1 per cent in 
Fig.m Here, we highlight the accuracy in the linear growth 
factor in the matter power spectrum, which is a benefit of 
using 2LPT in COLA; 10-time step Particle Mesh simula¬ 
tions, with conventional leapfrog integration alone, have 1- 
2 per cent error in the overall power spectrum amplitude 

i [Tze| ). 


The detail of calculating the power spectrum is as fol¬ 
lows. We assign matter densities on 324^ grids using the 
Cloud in Cell (CIC) assignment, using all dark matter par¬ 
ticles on the fly, and compute the density contrast in Fourier 
space, S(k), using a Fast Fourier Transform. The FFTW li¬ 
brary provides discrete S(k) for kz 0 — modes in the 
other half of the Fourier space do not contain independent 
information due to the reality condition S(—k) = S{k)*. 
To avoid double counting of modes on the kz = 0 plane, 
we use the modes {kz > 0} U {kz = 0 and ky > 0} U 
{kz = 0 and ky — 0 and k^ > 0}. We compute the av¬ 
erages P(k) = V~^ (5(k)3*(k)) and plot against the av¬ 
erage wavenumbers (k) in bins of a fixed width Afebin = 
0.01 /iMpc“^, where V is the volume of the simulation box. 
The average (P) is not an unbiased estimate of P({k}) in 
general; P({k}) — {P(k)} is guaranteed only if P(k) is a lin¬ 
ear function of k within the bin. We, therefore, average the 
analytical power spectra on the same discrete 3-dimensional 
grid for accurate comparison, which are plotted by cyan cir¬ 
cles in Fig. This discrete averaging makes statistically 
significant differences, especially between k = 0.01 hMpc“^ 
and 0.02/iMpc“^, where the power spectrum deviates sig¬ 
nificantly from a linear function, reaching the maximum and 
turning over. We correct for the smoothing and the aliasing 
effect using the procedure by |Jing| (|2005| . 


4 MOCK GALAXY CATALOGUES 

We populate the haloes with mock galaxies using the Halo 
Occupation Distribution (HOD) prescription. 


® www2.yukawa.kyoto-u.ac.jp/atsushi.taruya/cpt_pack.html 
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4.1 Halo Occupation Distribution (HOD) for 
WiggleZ galaxies 


We use a log-normal HOD (Zehavi et al. 2005 Cai et al. 


2011) for the emission-line galaxies in the WiggleZ sample. 


We assume that the probability that a dark matter halo of 
mass M hosts a WiggleZ galaxy is, 


P{M) = exp 


(logipM-login Mq)^ 

M 


(23) 


where log^p Mo and criog m are parameters fitted against 
data. We populate at most one galaxy per halo, without 
any satellite galaxies, and set the position and velocity of 
the galaxy equal to the averages of the host halo particles 
(i.e., the centre-of-mass position and velocity). We do not 
use the error function HOD ( [Zheng et al.|[^05[ ), because 
we do not expect to find emission-line galaxies, which are 
young star-forming galaxies, in groups or clusters hosted by 
massive haloes. 

We find the two HOD parameters by matching the pro¬ 
jected correlation function. 


oirp) ~ 


/ 71 

-I 


i{rp,-K)d-K, 


(24) 


with TTmax = 60 h“^Mpc. We perform the matching by pop¬ 
ulating a series of mock catalogues using a trial set of HOD 
parameters, computing the mock mean, and comparing the 
mock mean with the data by minimizing a statistic using 
a covariance matrix obtained from jack-knife re-sampling. 
Since log Mo and niogAf are degenerate, we fix niogM = 0.1. 
We find logiQ Mo = 12.17 for A^Near and Azpar, and 12.28 
for AzMid for FoF halo mass M without any corrections (all 
masses are in units of h~^MQ). COLA halo mass is about 
7 per cent smaller than true A-body simulation mass, but 
any constant calibration factor for the mass only rescales 
the parameters without any change in the HOD mock. 

We subsample the HOD galaxies by a realisation- 
independent factor to match the smooth number density 
without clustering n{x) using the survey selection function 
(Blake et al. 2010). At low redshift, there are rare cases 


that the number of HOD galaxies is not sufficient. In such 
cases, we increase the width of the HOD criog m for M < Mo 
to match the number density, keeping the HOD same for 
M > Mq. 


4.2 HOD for BOSS CMASS galaxies 

We also generate mock catalogues for the BOSS CMASS 
galaxies in the BOSS-WiggleZ overlap volume using the 


same simulation for the multi-tracer analyses (Beutler et al. 
2015 Marin et al.|2015 ). We refer the reader to these papers 
for the detail of the overlap regions. 

We use the error function for the central galaxies, and 
a power law for the satellite galaxies. We populate at most 
one central galaxy per halo with a probability. 


P{M) = - 


1 -I- erf 


/ logio Afzoo.rn - logio 


(25) 


\ rriogAf 

is a halo mass defined by the mass within a 


where M 2 oo,n 

sphere of radius r 2 oo,m whose mean overdensity is 200 times 
the mean matter density. We denote the similar quantities 
for 200 times the critical density with M 2 oo,c and r 2 oo,c. If 


the halo has a central galaxy, we draw a number of satellite 
galaxies from a Poisson distribution with mean, 

(26) 


(Msat) = (M 200 ,m/Mof. 


with 


A satellite HOD 
[(M —Mi)/Mo]“ (Zheng et al. 2005), is 


additional parameter, 
also used fre¬ 


quently, but Ml is usually not sensitive to the clustering 
data, and does not significantly improve the fit ([Blake et al.| 

^oMl. 


We add a random offset and a random virial velocity to 
the satellite galaxy assuming a spherical [Navarro, Frenk, fcj 
White (1997) profile, 


p(r) = 


Po 


(27) 


(r/rs)(l + rjrsY' 

We can uniquely determine the 2-parameter profile by 
specifying the mass M 2 oo,c and a concentration parameter 
C 200 ,c = r' 2 oo,c/rs. We draw a random concentration param¬ 
eter from a known relation in the literature, but there are 
several trivial steps to convert the halo mass to an appro¬ 
priate one: 

(i) We first set the FoF halo mass Mpop = 1.066 Mcola, 
which is based on our calibration between COLA and GAD¬ 
GET simulations (Fig. [^; 

(ii) compute the typical concentration factor c for mass 
Mpop using Prada et al.| ( [20i2 ), but the relation is given as 
a function of M 2 oo,c; 

(iii) convert the FoF mass halo to M 2 oo,c using 


Lukic et al. 


(2009), which depends on FoF mass and the concentration 


parameter. Their formula also correct for the resolution ef¬ 
fect for small number of halo particles: A 2 oo,c = M 2 oo,c/m, 
where m is the particle mass; 

(iv) start from an initial guess of Mjqq ^ = MyoF, and 
solve steps (ii) and (iii) iteratively for mean concentration 
c, 


= C200,c(M/*) 


*- 200,0 
-^*^^200,0 — 


200,c 


), 


M200.c(MpeP,A«o,„4+^^ 


(28) 

(29) 


which converge quickly within several iterations; 

(v) draw a random concentration parameter, log^Q C 2 oo,c 
from a Gaussian distribution of mean logjQ C 2 oo,c and stan¬ 
dard deviation aiogc = 0.078 ( [Manera et al. pOT3| ); 

(vi) recompute the mass M 2 oo,c using the generated 
C 200 ,c. This determines the halo profile completely, and we 
can compute M 2 oo,m from the profile; 

(vii) draw the number of central and satellite galaxies for 
given HOD parameter using M 2 oo,m; 

(viii) draw satellite positions from the static, spherical 
symmetric NFW profile from the phase-space distribution 
function. The static distribution function is uniquely deter¬ 
mined from the density profile, assuming spherical symme¬ 


try and isotropic velocity distribution (Kazantzidis et al. 


2004). 


We generate mocks for a grid of parameters, and find 
that logic Mmin = 12.92, criogAf = 0.31, log k, Mo = 14.07, 
and P = 1.60, fit the projected correlation function well. 
Since the HOD model contains several free parameters to fit 
the data, our procedure of converting the halo mass is prob¬ 
ably unnecessary. We also tried a concentration parameter 
relation by [Bullock et al.] ( [200ip with no additional scatter, 
but this made little difference. 
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Figure 6. We tune the HOD parameters to match the projected 
correlation functions Wp. The mock galaxy agree with the data 
within the uncertainties. (The error bars for the data are la.) 


In Fig.|^ we plot the projected correlation functions for 
the mock and the data. The solid lines are the mean of 3600 
realisations generated in the periodic box. The log-normal 
HOD without satellite galaxies fits the WiggleZ data well, 
while a small contribution from satellites may improve the 
fit for r ~ 0.7h“^Mpc. The BOSS CMASS galaxies clearly 
require satellite galaxies for r ~ 2h“^Mpc. 


4.3 Box remapping 


We analyse the galaxy sample in three redshift bins, but the 
length along the line of sight is still larger than the box size. 
We rotate the simulation box to fit the volume with mini¬ 


mum overlap, u sing the box remapping technique (Carlson 
& White 2010 1 as a guide. Their publicly available cod^ 
provides a list of possible remappings from a periodic cube 
to cuboids. We use two configurations, which we call \/2 
and %/3, depending on the size of the volume (Table [^. The 
lengths of the remapped cuboid along the line of sight are, 
Li = \/2I/ = 849/i“^Mpc, and L\ = \/3L = 1039/i“^Mpc, 
respectively, where L — 600h“^Mpc is the length of our 
simulation box on a side. In the table, we list the size of the 
cuboid after remapping, and the integer vectors Ui, which 
characterise the remapping. The integer vectors specify the 
orthonormal basis of the remapped coordinate, fii, as fol¬ 
lows: 


ei = ni/|iii| 

62 = n2/|ri2|, U2 = U2 - (Ul • ll2/|ni|^)lli, 

63 = 61 X 62. (30) 


^ http://mwhite.berkeley.edu/BoxRemap/ 


The basis vector 6 i points the line-of-sight, 62 points the 
declination, and 63 points the right ascension directions, re¬ 
spectively, at the centres of the six survey regions. We use 
the cuboid \/3 for which has enough length along 

the line of sight to fit the redshift range 0.2 to 0 . 6 , and use 
y/2 for and Ak^“ when we need a wider cuboid in 

transverse directions. A small fraction of the survey volume 
was larger than the remapped cuboid, and the same volume 
in the simulation box was used twice. The fraction of such 
volume is 1.7 per cent of the total volume. In Table ED we 
list the remapping we use and the fraction of overlap for 
each region. 


4.4 Mock catalogue 

The overall procedure for creating a mock catalogue from a 
halo catalogue is as follows: 

(i) We fill the space with periodic replications of the sim¬ 
ulation box, and rotate the positions and velocities to the 
remapped coordinate using the orthonormal basis (equa¬ 
tion [t5| ) ; 

(ii) apply the redshift space distortion to the halo posi¬ 
tion: 

V ■ X . 

s = x+ — —X, (31) 

ail 

where H is the Hubble parameter at scale factor a, and 
X = xl\x\ is the unit vector parallel to x\ 

(iii) populate the haloes with mock galaxies using the 
HOD (which may depend on the redshift-space position at 
low redshift to match the high number density); 

(iv) subsample the mock galaxies to match the selection 
function (mask) of the survey. The subsample fraction is 
calculated to match the observed number of galaxies as a 
mean. The numbers of mock galaxies fluctuate around the 
observed number. 

For the BOSS mock, we first generate the HOD galaxies 
and then apply the redshift-space distortions including the 
satellite virial velocities. We can interchange the step (ii) and 
(iii) because we use a position independent HOD parameters 
for the BOSS galaxies. In Fig. we plot slices of our WiZ- 
COLA mock catalogues for the 15hr region. 


5 ACCURACY OF HOD GALAXIES 


We test the accuracy of our mocks by comparing the HOD 
galaxies generated from COLA with the HOD galaxies gen¬ 
erated from GADGET N-body simulations. We generate 
HOD galaxies in the periodic simulation box and compute 
the power spectra. We use the HOD parameters described 
in the previous section for the GOLA HOD galaxies, but 
we determine different HOD parameters for the GADGET 
haloes to match the COLA power spectra in real space, be¬ 
cause HOD parameters are free fitting parameters that are 
usually adjusted for the observed galaxies. If we used the 
same HOD parameters and the halo mass relation (equa¬ 
tion 21 1 , we would get about 5 per cent higher galaxy power 


spectrum from GADGET haloes as we see in Section 
but this is not the HOD parameters we would use. We find 
logj^Q Mo = 12.275 for the WiggleZ log-normal HOD (width 
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Table 2. Two box configurations that we use to remap the cubic simulation box to cuboids, which are characterised by three integer 
vectors, Ui < |CarIson &: White|2010| |. Li are the lengths of three sides of the cuboid after remapping. 


Name u\ U 2 us Li L 2 Ls 

[/i“^Mpc] [h“^Mpc] [/i~^Mpc] 

x/2 (1,1,0) (1,0,1) (1,0,0) 848.5 734.8 346.4 

^ (1,1,1) (1,0,0) (0,1,0) 1039.2 489.9 424.3 


redshift z(xi) 


0.2 0.4 0.6 0.4 0.6 0.8 0.6 0.8 1.0 



600 800 1000 1200 1400 1600 1200 1400 1600 1800 2000 1600 1800 2000 2200 2400 


X, [/7^''Mpc] 

Figure 7. One realisation of the mock galaxy catalogues for the 15hr region. The depth of the slices is 50h~^Mpc. The coordinates are 
those of the remapped system, xi = x ■ e.i^ whose origin a; = 0 is the observer. 



Figure 8. HOD galaxy power spectra generated from COLA versus GADGET in real and redshift space. COLA HOD galaxies show 
good agreement with the GADGET HOD galaxies. The horizontal lines in the power spectra ratios are the results of minimum fitting, 
based on the diagonal errors in the ratio from 14 realisations. The uncertainties in the fitting are 95-per cent intervals. 
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Figure 9. The cross- and auto- power spectra of HOD galaxy 
density and line-of-sight peculiar velocity. COLA has accurate 
peculiar velocities. We do not find systematic error in the velocity- 
galaxy cross power for k ^ 0.2/rMpc“^ and in the velocity auto¬ 
power for fc ^ 0.15 hMpc“^; there are errors of about 3—5 per cent 
in the range 0.15hMpc“^ ^ A: ^ 0.5hMpc“^. 


CTiogM is fixed to 0.1), and logj^Q Mmin = 12.92, criog At = 0.37, 
logj^Q Mo = 14.00, and /3 = 1.45 for the BOSS HOD. 

In Fig. 1^ we plot the power spectra in real and red- 
shift space. We compute the monopole {I = 0) and the 
quadrupole {i = 2) moments for the redshift-space power 
spectrum P®, 

P!{k) = {2£+l) j Pi{pi)P‘{k,^L)d^jL, (32) 


where Pt is the Legendre polynomial, and fi — k ■ eo is the 
consine of the angle between the wave vector and the fixed 
direction of the redshift-space distortion, es, which is set to 
the direction of the third axis. The procedure of computing 


the power spectra is the same as that in Section |3.1 


only difference is that we also subtract the shot noise (Jing 


the 


2005). 


In the lower panels, we plot the ratio of the power spec¬ 
tra. Although the HOD galaxies are based on simulations 
with the same initial condition, the ratio of the power spec¬ 
tra is affected by the randomness in populating the haloes 
with galaxies. The error bars are 2g of the mean (equa¬ 
tion 221 based on 14 realisations. The real-space power and 
the redshift-space monopole are very accurate; the ratios are 
consistent with unity for k 0.2 fiMpc”^ within the statisti¬ 
cal fluctuation, and the statistical error is about 1 per cent. 

Since we do not have enough statistics for the 
quadrupole moment for precise comparison, we also com¬ 
pute the cross-power spectra, Pgu, and auto- power spectra, 
Puu, between the galaxy density and the line-of-sight pecu¬ 
liar velocity u = ws, to show the accuracy of the peculiar 
velocities. The redshift-space distortion is an effect of pecu¬ 
liar velocity, and the power spectrum in redshift space, P”, 
is approximately related to the galaxy density and velocity 


power spectra in real space | Scoccimarro||2004 ) 


P‘{k,^) « Pgg{k) + 2kfilmPgu{k, )j,) + {kfif Puuik, ki-)- (33) 

(In the linear limit, the power spectra are proportional to 
the matter power spectrum Pm, via ImPgu = fb^Pm/k, and 

Puu /V" Pm / k ^, respectively, where b is the linear galaxy 

bias and / = dinDi/din a is the linear growth rate.) In 
Fig-i we plot the angle-averaged cross- and auto- power 
spectra, Pgu{k, fi)dfi and Puu{k, fj,)dfj,. We refer the 
reader to our previous paper for technical details I Koda et al. 


2014). The cross power spectra are also accurate with about 


1 per cent scatter, but the velocity-velocity power spectra for 
haloes (BOSS central galaxies and WiggleZ galaxies) have 
about 3 per cent error for k ~ O.l/iMpc”^, and 5 per cent 
error for k ^ 0.2/iMpc“^. The BOSS satellite galaxies add 
additional error due to different virial velocities caused by 
different HOD parameters; this discrepancy of about 10 per 
cent shows that the velocity power spectrum is sensitive to 
HOD parameters, in general, through the non-linear ran¬ 
dom velocities, and is not necessarily a failure of the COLA 
mocks. 

A good agreement in the real-space power spectrum is 
not difficult to achieve by tuning the HOD parameters or 
non-linear biasing models for haloes, but such tuning does 
not usually work simultaneously in redshift space. Faster 
mock generation techniques that uses 2LPT usually have 
about 5 per cent error in the monopole and 10 per cent er¬ 
ror in the quadrupole of the redshift-space power spectrum 


I Chuang et al. 2015b I. The primary advantage of COLA over 


2LPT based methods is the accuracy in the non-linear pecu¬ 
liar velocity, which may be important for the error evalua¬ 
tion of BAO reconstruction, and measurement of the growth 
rate. The accurate peculiar velocity is limited to that for 
haloes, and we do not expect accurate densities or virial ve¬ 
locities inside haloes. We find discrepancies of 10 per cent at 
k = 0.1 /iMpc“^, and 20 per cent at fc = 0.2 fiMpc”^, respec¬ 
tively, in redshift-space power spectra for A^-body particles 
between COLA and GADGET, which seem to be conse¬ 
quences of inaccurate virial velocities inside the haloes. 

Ideally we would like to compare the accuracy of the co- 
variance matrix, since the main purpose of generating multi¬ 
ple realisations of mock catalogues is to compute covariance, 
but we do not have enough GADGET A^-body simulations 
for covariance matrices. We do not have enough realisations 
to compare the two-point correlation function precisely, ei¬ 
ther. We leave these comparisons for future studies. 


6 CONCLUSION 

• We have presented the WiZ-COLA simulation, which 
consists of 3600 simulations with 1296^ particles that covers 
the volume of (600/i“^Mpc)®, and resolve haloes of mass 
10^^/i“^Mpc, using our new parallelized COLA code. The 
simulation took only 200k core hours in total. 

• We generate 600 realisations of mock galaxy catalogues 
for the WiggleZ survey, and the BOSS CMASS galaxies in 
the overlap regions using HODs. We show that COLA can 
create mock HOD galaxies as accurate as GADGET A-body 
simulations for large-scale power spectra for wavelength k ^ 
0.2 IiMpc”^, both in real- and redshift-space. 
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• The accuracy in peculiar velocity is the primary ad¬ 
vantage of COLA simulations. We show that velocity power 
spectra are accurate within a per cent for fc ^ 0.15 hMpc and 
3 per cent for 0.2 hMpc, and we expect a similar accuracy for 
the quadrupole moment of the galaxy power spectra in red- 
shift space. The accuracy of the galaxy-velocity cross-power 
spectra and monopole moment of galaxy power spectra is 
better than 1 per cent. 

• Another benefit of the COLA approach is that the lin¬ 
ear growth rate of matter fluctuation at large scales is deter¬ 
mined much better than 1 per cent. This is not a great ad¬ 
vantage for galaxy survey, because of the few-per-cent error 
in galaxy bias, but could be an advantage for gravitational 
lensing surveys. 
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APPENDIX A: IMPACT OF INITIAL 
CONDITION 

We show the impact of our initial condition, which is given 
at a = 0.5/nstep for the velocity (equation |16[ ), compared to 
the original one at a = 0.1 for both the position and velocity, 

= 0, = 0. (Al) 

This original initial condition gives slightly better results, 
although our initial condition is not problematic in theory. 
The ansatz for COLA with ulpt = —2.5 is tuned for the 
original initial condition at a = 0.1, and the same ansatz is 
probably not optimal for onr initial velocity at a = 0.05. 

In Fig. |A1[ we plot the ratio of the matter power spec¬ 
tra to that of the GADGET A-body simulations at z = 0.6 
for different number of steps with the original initial con¬ 
dition. We divide the time equally in scale factor between 
0 and 1, a(ti) = i/ustsp, for Uatep = 10,20,50, and 100. 
The original initial condition gives better accuracy around 
k — O.l/iMpc”^, without the 2-3-per-cent excess in Eig. 
the agreement is better than 1 per cent for k ^ 0.3 /iMpc“ . 
The range with accurate matter power expands as we in¬ 
crease the number of steps. 

In Eig. |A2[ we plot the accuracy in the halo bias 
and mass. The original initial condition gives a slight im¬ 
provement for the halo bias as well — from 5-per-cent er¬ 
ror in Fig. to about 3 per cent for 10 time steps. We 
split the haloes to groups with an equal number density 
of 10“^(/i~^Mpc)“® by their mass and compute the halo 
bias, as we did for Fig. The halo bias improves to about 
1 per cent for 100 steps. The lower panel shows the mean 
halo mass in each group. Our COLA simulations does not 
converge to the GADGET simulation because we have the 
uniform PM grid for force computation, and that causes an 
additional error in the halo formation independent of time 
steps. The PM force recovers the correct force at a distance 
of about 2.7 times the PM grid size, which corresponds to 
a virial radius of a halo of mass M 2 oo,m = 5 x Mq 

for our conhguration; the limited force resolution below this 
scale explains the deviation from the correct halo mass. 

In this Appendix, we have shown that our excess in our 
matter power spectrum was caused by our initial setup for 
the velocity, and the accuracy of GOLA simulations could 
improve slightly by using the original initial condition. 



k [hMpc '] 

Figure Al. The matter power spectrum with the original ini¬ 
tial condition, which gives slightly more accurate power spectrum 
than Fig. 1^ 
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Figure A2. The precision of COLA halo bias (Upper panel) and 
halo mass {Lower panel) for various time steps. The original ini¬ 
tial condition gives slightly better biases than Fig[^ The accuracy 
become about 1 per cent for 100 steps, while the halo masses do 
not show monotonic convergence. 
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Table Al. We list the number of galaxies, -/V^iggiez? the mean numbers of mock galaxies and their standard error in the mean for 
3600 realisations, NwiggieZi the survey volume in units of 10^(h~^Mpc)^, for the six regions in the sky decomposed to 3 redshift 
bins. The cuboid is one of the box remappings listed in Table The ’overlap’ is the fraction of the survey volume that overlaps in 
the periodic simulation box in per cent — the overlapped volume consists of two copies of the same simulation volume. Since 
completely overlaps with the other two redshift bins, the total, in the final row, is the sum for Near and Far redshift bins. 


reg 

Az 

A^WiggleZ 

-^^ViZ-COLA 

volume (10^) 

cuboid 

overlap 

1 hr 

Near 

6927 

6927.63 ± 3.3 

2.81 

Vs 

0 

1 hr 

Mid 

9437 

9436.5 ± 3.4 

4.98 

Vs 

0 

1 hr 

Far 

7880 

7882.2 ±3.1 

7.12 

Vs 

0 

3 hr 

Near 

8000 

8000.3 ± 3.6 

2.89 

Vs 

0 

3 hr 

Mid 

10241 

10240.7 ± 3.6 

5.12 

Vs 

0 

3 hr 

Far 

8756 

8760.0 ±3.1 

7.33 

Vs 

0 

9 hr 

Near 

15128 

15131.0 ±5.0 

4.82 

Vs 

0 

9 hr 

Mid 

18978 

18984.0 ±5.1 

8.53 

Vs 

0 

9 hr 

Far 

11424 

11418.6 ±3.4 

12.20 

V2 

0.58 

11 hr 

Near 

18019 

18020.1 ± 5.1 

6.25 

Vs 

0 

11 hr 

Mid 

22289 

22299.2 ± 4.8 

11.07 

V2 

5.08 

11 hr 

Far 

13919 

13894.9 ± 3.3 

15.84 

V2 

1.73 

15 hr 

Near 

22309 

22312.3 ±6.1 

7.12 

Vs 

0 

15 hr 

Mid 

30015 

30024.6 ±6.1 

12.62 

V2 

4.88 

15 hr 

Far 

19471 

19428.3 ±4.4 

18.05 

V2 

5.66 

22 hr 

Near 

15884 

15883.6 ±6.5 

3.55 

Vs 

0 

22 hr 

Mid 

16146 

16142.7 ± 5.4 

6.29 

Vs 

0 

22 hr 

Far 

11024 

11025.9 ±3.8 

9.00 

Vs 

0 

Total 


158741 


97.00 


1.7 
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